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Locating sound sources such as prey or predators is critical for survival in many 
vertebrates. Terrestrial vertebrates locate sources by measuring the time delay 

and intensity difference of sound pressure at each ear’ >. Underwater, however, the 
physics of sound makes interaural cues very small, suggesting that directional hearing 


in fish should be nearly impossible®. Yet, directional hearing has been confirmed 
behaviourally, although the mechanisms have remained unknown for decades. 
Several hypotheses have been proposed to explain this remarkable ability, including 
the possibility that fish evolved an extreme sensitivity to minute interaural differences 
or that fish might compare sound pressure with particle motion signals’*. However, 
experimental challenges have long hindered a definitive explanation. Here we 
empirically test these models in the transparent teleost Danionella cerebrum, one of 
the smallest vertebrates”. By selectively controlling pressure and particle motion, 
we dissect the sensory algorithm underlying directional acoustic startles. We find that 
both cues are indispensable for this behaviour and that their relative phase controls 
its direction. Using micro-computed tomography and optical vibrometry, we further 
show that D. cerebrum has the sensory structures to implement this mechanism. 

D. cerebrum shares these structures with more than 15% of living vertebrate species, 
suggesting a widespread mechanism for inferring sound direction. 


Whether it is the song of a whale, the rustling of a mouse or the quiet 
sneaking of a cat, when sound emanates from a source, it travels through 
the surrounding medium as an oscillation of motion and pressure. The 
ability to detect its direction turns hearing into a spatial sense that is 
pivotal for survival. 

Terrestrial vertebrates sense direction by sampling pressure at the 
ears (that is, two positions far apart; Fig. 1a). In the late 1940s, Lloyd 
Jeffress speculated that a combination of delay lines and coincidence 
detector neurons could measure the interaural time difference (ITD). 
Animplementation of the Jeffress model was later found by Carr and 
Konishi in their seminal work on the barn owl brainstem’, making it 
one of the most well-known canonical circuits in vertebrates and a 
textbook example for the convergence of theory, behaviour, biophysics 
and physiology?*. In addition to the ITD, the difference in the pressure 
amplitude across the ears, called the interaural level difference (ILD), 
also carries information about the sound direction. This quantity is 
amplified by the head, which reflects airborne sound and casts an 
acoustic shadow onto the far ear’. 

The vertebrate sense of hearing evolved from acommon fish ances- 
tor, and yet these well-established models for directional hearing falter 
when applied to underwater environments. As sound travels approxi- 
mately five times faster in water than in air, ITDs are reduced to very 
small levels (Fig. 1b). As biological tissues have acoustic impedances 
similar to water, ILDs are substantially smaller, too. This presents a 


conundrum: according to prevailing models, fish should not be able 
to localize sound. Yet, behavioural evidence shows that fishes such as 
the Atlantic cod’®"™, the plainfin midshipman”, herring?” and 
goldfish?* *° can determine the direction of sound sources. 

What cues are available to fish that might enable directional hearing? 
Fish have two distinct peripheral auditory pathways”. First, otolithic 
end organs of the inner ear, in addition to their vestibular function, 
also act as particle motion sensors for nanometre to micrometre dis- 
placements. Owing to the morphology of hair cells, this direct hear- 
ing pathway is inherently spatial, but it allows animals to tell only the 
axis of sound propagation, not the direction—a limitation termed the 
180° ambiguity problem of directional hearing®. The second, so-called 
indirect, hearing pathway relies on the swim bladder, which is filled 
with compressible gas that oscillates in a pressure field. In Otophysi, 
a large superorder containing about 66% of freshwater fish species”, 
aseries of bones (Weberian ossicles) transmits this motion to the 
inner ear? ?,, 

In 1975, Arie Schuijf proposed a model”* suggesting that fish could 
theoretically deduce the direction of sound if they were able to sepa- 
rately sense and compareits motion and pressure components. Alter- 
native hypotheses include the possibility that fish evolved an extreme 
sensitivity to minuscule ITDs and ILDs or use their mechanosensory 
lateral line organ for directional inference through an unknown 
mechanism. Despite almost a century of careful work on this topic 
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Fig. 1| Sounds elicit a directional startle reflex in Danionella cerebrum. 
a, Schematic of a pressure wave, arriving at the auditory organs witha 
detectable ITD in humans. b, ITDs are heavily diminished underwater (value 
approximated for D. cerebrum). c, Behavioural setup (Methods). d, Playback 
paradigm. Before the experiment, sound pressure and particle acceleration 
are calibrated at multiple points inside the inner tank (top left, orange 
crosses; see also Extended Data Fig. 1c-e). Playback is triggered if three 
conditions are met: the fish swims into the trigger zone (top right, dotted 
green rectangle), the fish is oriented <45° to the y axis, and >5 s have passed 
since the last playback (Methods). e, Startles are detected by aspeed 
threshold after sound playback (Methods; see Extended Data Fig. 4 for details 
on startle dynamics). Top: centred trajectories after playback for startles 

(n =1,415) and non-startles (n = 2,383) across all fish (n = 65). Bottom: average 
fish speed for startles and non-startles, aligned to sound trigger at t= O. 

f, Centred startle trajectories in two sound configurations show a directional 
escape away from the left (81% of n - 125 startle trials across 58 fish; two-sided 
binomial test: P = 2 x 10”) or right (79% of n= 115 startle trials across 56 fish; 
two-sided binomial test: P= 2 x 10”) speaker. g, Pooled centred trajectories 
from f with flipped trajectories for the right speaker stimulus summarize the 
directional escape away from the single speaker (80% of n = 240 startle trials 
across 63 fish; two-sided binomial test: P = 1 x 10”). f,g, The heat maps are 
normalized and smoothed two-dimensional histograms over endpoint 
positions of the trajectories (grey for single-stimulus data; blue for pooled 
data). Scale bars, 10 cm (d) and 5mm (f-g). 
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(Supplementary Table 1), starting with Karl von Frisch’s studies on min- 
nowsin1935*9, the mechanism of directional hearing in fish is debated 
to this date. 

Empirical tests of the directional hearing mechanism have been 
fraught with several difficulties. First, reliably controlling sound stimuli 
underwater is challenging owing to echoes and sound reverberations, 
as well as near-field effects in small tanks". Second, although fish 
are sensitive to particle motion, many studies control only for pres- 
sure, So several authors have urged experimenters to control both 
guantities*!***/ Third, the sense of hearing, mediated by the inner ear, 
must be distinguished from the lateral line sense, which is sensitive to 
low-frequency water flow (<200 Hz)***. Fourth, behavioural paradigms 
should exclude the possibility of klinotaxis (that is, a sequential gradi- 
ent sampling strategy) instead of true directional hearing. Finally, the 
study of sound transduction to the fish’s inner ear?***?” is complicated 
by tissue opacity. 

Here we address these challenges and systematically test hypoth- 
eses of directional hearing mechanisms using the transparent fish 
Danionella cerebrum, one of the smallest known vertebrates?!99*759, 
Its small size makes D. cerebrum well suited for high-throughput experi- 
ments under controlled laboratory conditions. Moreover, D. cerebrum 
communicate acoustically, underlining the importance of hearing 
for their behaviour. With an inner ear separation of less than 1mm, 
D. cerebrum put interaural comparison mechanisms to their ultimate 
test (Fig. 1b). 

We find that D. cerebrum perform directional startles away froma 
sound source and that this ability is independent of the lateral line. We 
then present an extensive set of controlled particle motion and pressure 
stimuli that lead to differential predictions for directional responses, 
depending on seven alternative hypotheses for the mechanism of 
directional hearing. Finally, we carry out laser-scanning vibrometry 
of auditory structures across the transparent body to determine the 
physical cues available to D. cerebrum. Together, the findings of our 
experiments lead us to reject all but one of the proposed hypotheses 
for directional hearing, and they provide strong support for Schuijf’s 
model that fish compare the phase between pressure and motion to 
tell the direction of a sound source. 


Directional startle responses 


To test whether D. cerebrum can hear sound direction, we tracked their 
motion in an aquarium surrounded by underwater speakers (Fig. 1c, 
Methods and Extended Data Fig. 1a,b). We played back transient 
sounds (about 12 ms duration, about 0.7 ms rise time, 780 Hz centre 
frequency; Methods and Extended Data Figs. 2 and 3) and quantified 
the direction of their startle reflex. The experiment was carried out 
with one fish at a time, and playback depended on the fish’s orienta- 
tion and position, to test left-right directional hearing and cancel 
position-dependent echoes (Fig. 1d and Methods). 

Shortly after sound onset (within17 ms or 2 video frames), D. cerebrum 
performed a characteristic startle reflex involving fast sideward dis- 
placement (see Fig. 1d,e, Methods and Extended Data Figs. 4 and 5 for 
startle dynamics, probabilities and habituation). We used the relative 
displacement along the left-right speaker axis 50 ms after startle onset 
asa readout of directional response and found that D. cerebrum startle 
away from the speaker, irrespective of the speaker location (left or 
right; Fig. 1f). Consequently, left and right responses were pooled in 
ametric for directional escape away from the speaker (Fig. 1g). These 
single-speaker sound playbacks resulted in directional escapes for 
80% of all startles (191 of n = 240 startles across 63 fish with at least 
one startle, two-sided binomial test: P= 1 x 10”), an effect present in 
both sexes (Extended Data Fig. 6a). A replication of the experiment in 
lateral line-ablated fish showed equivalent results of directional hear- 
ing (Extended Data Figs. 7b and 9b). Thus, female and male D. cerebrum 
exhibit directional hearing independent of lateral line function. 
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Fig. 2| The relative phase between pressure and particle motion predicts 
startle direction. a, Schematic of D. cerebrum hearing apparatus anatomy. 

b, Schuijf’s model: a plane wave from the left differs froma plane wave from the 
right in terms of the phase relationship between pressure and directed particle 
velocity. c, Illustration of stimulus configurations. Plus, minus and arrows 
inside speaker symbols refer to speaker signal polarity. L, left speaker; R, right 
speaker; P, only pressure; M, only motion. The schematics of resulting traces 
illustrate the pressure and particle velocity relationship (see Extended Data 
Figs. 2 and 3 for actual traces). Speaker schematics show a simplified 
configuration. Active echo cancellation typically involved three speakers 
(Supplementary Table 2). d-f, Centred startle trajectories after sound 
playback. All statistical tests are two-sided binomial tests. d, Directional 


Physical cues and hypotheses 


We then inquired about possible algorithms that may underlie this 
directional hearing behaviour. As directionality has to be inferred from 
physical acoustic cues, we gathered those cues potentially available 
to D. cerebrumto list compatible hypotheses for a directional hearing 
algorithm. Adults have an inner ear separation (about 0.6 mm; Fig. 2a) 
that is orders of magnitude smaller than the wavelength of the sounds 
recorded in their natural habitat (=150 mm for sounds upto 10 kHz). In 
addition, the mismatch between the characteristic acoustic impedance 


escapes away from a single speaker for a positive polarity signal (same data as 
Fig. 1g) and negative polarity signal (80% escapes, P=7 x10?,n=258startle 
trials to this sound in 61 fish). e, Absence of significant directional bias inthe 
positive polarity condition with only pressure (44% of n=90 to right, not 
significant; P> 0.05) and only particle motion (46% of n=192 to right, not 
significant; P> 0.05). f, Single-speaker playbacks pooled over both polarities 
evoke a directional escape (80% escapes, P- 4 x 10°, n= 498 startle trials in 
64 fish). Selective inversion of pressure polarity by an additional pair of 
speakers along the orthogonal axis inverts the relative polarity between 
pressure and particle velocity, which tricks the fish into performing startles 
approaching the active speaker (67% approach, P-1x107, n=331startle trials 
in 61 fish). Scale bars, 500 um (a) and 5 mm (d-f). 


of water (1.5 MPa s m”) and biological tissue (about 1.6-1.7 MPa s m'!)69 
is small, unlike the approximately 4,000-fold mismatch between bio- 
logical tissue and air (0.0004 MPa s m”). Hence, D. cerebrum can hardly 
break left-right symmetry by casting an acoustic shadow. However, 
close to a monopole sound source, ILDs could occur owing to the 
steep decay of the sound field with distance (Extended Data Fig. 8a). 
In D. cerebrum, the level drop between both inner ears at 3 cm distance 
from a monopole sound source can be as large as about 2% for pres- 
sure, irrespective of freguency, and about 496 for particle velocity at 
frequencies below 4.2 kHz. Hence, sound direction could be inferred 
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through two pressure sensors (P-ILD; hypothesis 1) or two particle 
motion sensors (M-ILD; hypothesis 2). These two strategies would 
work best for nearby sounds coming from a direction along the axis 
of the sensor pair. 

The other binaural cue may be time differences in pressure (P-ITD, 
hypothesis 3) or particle motion (M-ITD, hypothesis 4). The maximal 
ITD between D. cerebrum's inner ears is 0.4 us, orders of magnitude 
smaller than in terrestrial vertebrates*. An ITD mechanism would thus 
point to an extreme sensitivity to minutetime differences. 

Each inner ear hair cell deflects along a preferred axis. Continu- 
ous sinusoidal sounds from opposing directions would stimulatethe 
same hair cells, leading to the 1809 ambiguity problem. If, however, 
all ecologically relevant sounds started with compression, theanimal 
could interpret initial particle motion as motion directed away from 
the source. This sense would reguire just a single particle motion sen- 
sor for each axis. We call this possibility M-polarity hearing for either 
positive or negative polarity (hypotheses 5 and 6). 

Finally, Schuijf’s model for directional hearing resolves the 180° 
ambiguity problem by using pressure as a reference guantity (hypoth- 
esis 7). Theidea is most easily illustrated for a plane wave (Fig. 2b), 
but its applicability is not limited to it (Extended Data Fig. 8b): atthe 
phase of high compression, particles move away from the source, no 
matter if it started with push or pull. Thus, the pressure signal can act 
as a reference signal to resolve the 180° ambiguity of particle motion. 

To distinguish between these hypotheses, we took advantage of 
D. cerebrum's small size and precise control over stimuli under labora- 
tory conditions. This allowed us to create stimuli that would normally 
not occur in nature (for example, sounds with pure pressure and no 
particle motion component) and to behaviourally dissect the algorithm 
used by D. cerebrum to tell sound direction. 


The directional hearing algorithm 


The observed directional startle responses (Fig. 1f,g) are a reaction to 
naturalistic two-component sound consisting of pressure and particle 
motion signals (Fig. 2c(i)). To investigate how the isolated components 
of this sound field affect D. cerebrum, we created a pure pressure stimu- 
lus and a pure particle motion stimulus. The pure pressure stimulus 
was realized by driving opposing speakers in phase (Fig. 2c(iv)), and 
the pure motion stimulus was realized by driving them out of phase 
(Fig. 2c(v)), creating standing waves with nodes at the animal’s loca- 
tion. Our approach additionally took into account echo cancellation 
and the spatially mapped frequency response of our recording tanks 
to present controlled and reproducible stimuli (Methods and Extended 
Data Figs. 2 and 3). We found that either component alone (pure pres- 
sure or pure motion) can elicit startles (Extended Data Fig. 7). However, 
neither component by itself elicited directionally biased responses, 
suggesting that both are necessary to tell sound direction (Fig. 2e and 
Extended Data Fig. 7). 

The unbiased responses to the pure particle motion stimulus were the 
first evidence against a directional hearing algorithm based on initial 
particle motion polarity (M-polarity). To further test this hypothesis, 
we presented amplitude-inverted waveforms, for which the M-polarity 
algorithm would predict a reversed startle response. However, 
D. cerebrum performed startles away from the speaker for both polari- 
ties (Fig. 2d; pooled single-speaker data in Fig. 2f: 80% escape of n = 498 
startles across 64 fish, two-sided binomial test: P= 4 x 10°*7). Hence, 
we ruled out particle motion polarity (M-polarity) as the sole cue for 
directional hearing. 

To test whether it may be the relationship between pressure and 
motion that determines the startle direction, we selectively inverted 
the pressure signal while leaving particle motion unchanged (similar 
to refs. 8,14). As we have seen, activating the left speaker (for exam- 
ple) evokes startles towards the right, away from the speaker, for 
both positive or negative waveform polarity (configurations shown 
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in Fig. 2c(i,ii)). By introducing an additional pressure source through 
in-phase activation of two speakers orthogonal to the left-right axis 
(Fig. 2c(vi)), we were able to selectively invert pressure, creating stimuli 
with pressure and particle motion cues akin to a sound originating 
from the right (Fig. 2c(iii)), despite the right speaker being inactive 
(Methods). In this ‘trick condition’, Schuijf’s model predicts reversed 
startle behaviour (that is, ‘escape’ towards a speaker). 

Indeed, D. cerebrum could be tricked: following pressure inversion, 
D. cerebrum performed startles towards the active speaker rather than 
away (Fig. 2f, 67% approach, two-sided binomial test: P=1* 10°’, n = 331 
startle trials in 61 fish). This result held true for both sexes, within indi- 
vidual fish, for both pressure polarities and in lateral line-ablated fish 
(Extended Data Figs. 6b-d, 7 and 9d). 

To check whether any binaural mechanism explains startles towards 
the speaker in the trick condition, we estimated the amplitudes and 
signs of P-ITD, M-ITD, P-ILD and M-ILD on the basis of theory and 
pressure measurements on both sides of the fish (see the section of 
the Methods entitled Estimation of binaural cues). We found that the 
signs of M-ITD and M-ILD remain unchanged when creating the trick 
condition and cannot explain the reversal of D. cerebrum's escape direc- 
tion. This suggests that neither interauraltimenor level comparisons 
are sufficient toinfer sound direction from particlemotion. 

In the single-speaker experiments, the distance-dependent decay 
of absolute pressure is 4.4 Pa over 600 um, which is 2% ofthe pressure 
amplitude and potentially large enough to be detected. When we selec- 
tively invert pressureto realizethetrickcondition, wealso effectively 
invert the sign of the level gradient along the horizontal x axis, as well as 
the sign of the phase delay (see Extended Data Fig. 8d for ageometrical 
explanation). Therefore, P-ILD and P-ITD are the other mechanisms 
that are in agreement with the behavioural response of D. cerebrum. 

In summary, D. cerebrum can be tricked into startling towards the 
speaker rather than away (see also Supplementary Video 1). Among 
all seven hypotheses considered, this behaviour is consistent with 
only three (Fig. 4): Schuijf’s model, which relies on the phase compari- 
son between pressure and particle motion; and the P-ILD and P-ITD 
mechanisms, which both rely on sensing pressure level at two positions 
inspace. 

To determine which of these remaining three hypotheses is com- 
patible with D. cerebrum’s sensory anatomy, we next asked whether 
D. cerebrum possess sensory organs that can detect pressure and par- 
ticle motion, as required by Schuijf’s model, or sensory organs that 
can detect a difference in pressure amplitude along the azimuth, as 
required by the P-ILD and P-ITD mechanisms. 


The hearing apparatus 


On the basis of micro-computed tomography (micro-CT) and optical 
vibrometry, we characterized D. cerebrum’s auditory organs to narrow 
down the candidate mechanisms for directional hearing. We started 
by visualizing the anatomy of D. cerebrum’s hearing apparatus with 
micro-CT imaging of a phosphomolybdic acid-stained sample (Meth- 
ods). We segmented the main components of the hearing apparatus 
suchas the swim bladder, the labyrinths, the otoliths (lapillus, asteriscus 
and sagitta) and otolithic end organs (utricle, lagena and saccule), the 
ossicles of the Weberian apparatus (including tripus and scaphium), 
and the lymphatic chambers that engulf the lagena and the saccule 
(Fig. 3a and Supplementary Video 2). 

To study the motion of these auditory structures in response to the 
sound stimuli, we built an imaging vibrometer based on laser-scanning 
confocal reflectance microscopy: by time-gated acquisition of each 
pixel with respect to a continuous sinusoidal sound playback, it was 
possible to infer the relative phase of the motion of auditory structures 
in two dimensions (Fig. 3b, Methods and Extended Data Fig. 10). 

First, we investigated whether the hearing apparatus of D. cerebrum 
is capable of detecting pressure. To this end, we drove two opposite 
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Fig.3|TheD. cerebrum hearing apparatus can detect pressure and particle 
motion separately. a, Segmentation of the hearing apparatus based on 
micro-CT. The fields of view for vibrometry phase maps in c-e are indicated. 

b, Illustration of the vibrometry method. A laser-scanning confocal reflectance 
microscope is used to image the motion of auditory structures ina sound field. 
Each pixel is sampled at four sound phases to record sound-induced motion in 
thex-y plane. See Methods and Extended Data Fig. 10 for details. c-e, Particle 
velocity phase (g) and amplitude (A) maps for motion along the left-right 
speaker axis; phase colour wheel shown in b, normalized to the amplitude A max 
indicated above the maps. c, A pressure stimulus causes anti-phase motion of 
the tips of the tripus and scaphium along the medial-lateral axis (indicated 
with red and cyan arrows). d, Along the dorso-ventral axis, a pressure stimulus 
creates phase-lagged motion of the scaphium and sagitta with the surrounding 
tissue. e, A particle motion stimulus results in relative motion between the 


speakers in phase, thereby creating a ‘pressure-only’ stimulus. In this 
condition, we observed anti-phase motion of the left and right tripus 
and scaphiumalong the medial-lateral axis, periodically compressing 
lymphatic space (Fig. 3c,d). This motion stems from swim bladder 
compression and expansionina pressure field. In addition, we observed 
rotational motion of the sagitta, part of the saccular end organ that lies 
embedded in an adjacent second lymphatic space (Fig. 3d). By contrast, 
we did not observe relative motion between the surrounding tissue and 
the lapillus or the asteriscus (Extended Data Fig. 11). We concluded that 
an indirect pressure sensing pathway exists in D. cerebrum and that 
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lapillus and surrounding tissue along the medial-lateral axis. Pooling over pixels 
within the region of interest, the displacement amplitude of the surrounding 
tissue is 1.24 um £0.14 um, and that of the lapillus is 0.95 um + 0.05 pm. Ata 
relative phase of 0.14 1 + 0.05 n, the relative displacement can be as large as 
0.56 um (that is, 45% of the surrounding tissue motion). f, Indirect pathway for 
pressure sensing: in a pressure field, the swim bladder oscillates, which moves 
the tripus and the scaphium inwards and outwards. Lymphatic spaces probably 
couple the motion of the scaphium to thesagitta/?”*. g, Direct pathway for 
particle motion sensing: the tissue of the fish couples to the particle motion 

of water, but the denser lapillus lags in phase and moves with lower amplitude, 
leading to a relative displacement that could be sensed by utricular hair cells 
underneath the lapillus. Observations shown in c-e were repeated once, four 
times and once in other fish, respectively, with similar results. Arrows inf,g 
indicate direction of motion. Scale bars, 250 um (a) and100 pm (c-e). 


the saccule may be its main end organ, in agreement with findings in 
other fishes?®?°*!, 

Second, we studied particle motion sensing by driving two opposite 
speakers in anti-phase. In this condition, with large particle motion but 
low-pressure signal, the lapillus, the otolith of the utricular end organ, 
moved witha phase lag (about 0.14 tt + 0.05 1) and at a lower amplitude 
(about 76% + 10%) than the surrounding tissue (Fig. 3e). This creates a 
relative motion that is expected to stimulate the underlying hair cell 
epithelium (direct pathway). We did not detect such relative motion for 
the tripus, the sagitta and the asteriscus. Note that we used stimuli with 


Nature | www.nature.com | 5 


Article 


i ii and iii iv v vi 
Single Single Pressure Motion Trick 
speaker + speaker — 
A 
a & & Pa | g 
Y w Consistent 
Model Reguired sensors Predictions for directional behaviour with data 
LL Lateral line x 
M-ILD 2 particle motion = = e> WE ra x 
M-ITD 2 particle motion r a >l <> En x 
M-polarity (+) 1 particle motion rr ie <>! eS re x 
M-polarity (-) 1 particle motion - m |si = x 
P-ILD 2 pressure > = ex a> - x 
P-ITD 2 pressure > > — <>| <- x 
Schuijf's model 1 Particle motion 
U and 1 pressure di a | >i as v 
Sensors found Directional behaviour (data) 
22 particle motion 
> <>| <>] < 
and 1 pressure 


Fig. 4 | Evidence for Schuijf’s model of phase comparison. We consider seven 
models for directional hearing that depend on different sensory structures and 
predict different behaviours (an eighth lateral line (LL)-based mechanism can 
be ruled out as we observe directional behaviour despite lateral line ablation). 
Interaural time difference (M-ITD) and level difference (M-ILD) mechanisms 
based on particle motion can be rejected on the basis of the behavioural data 
inthe trick configuration. A strategy that is based on escaping positive 
(M-polarity (+)) or negative (M-polarity (—)) initial motion can be rejected as 
inverted polarity waveforms fail to invert the startle direction. Finally, a 
mechanism based on sensing pressure level or time differences (P-ILD, P-ITD)— 
consistent with behavioural data—can be rejected as D. cerebrum possess only 
asingle pressure sensor. This leaves Schuijf’s model as the one that correctly 
predicts an inversion of startle direction in the trick configuration and that is 
based on sensory cues that D. cerebrumis able to sense. 


particle motion along the mediolateral axis relevant for the left-right 
startle behaviour. To support directional hearing in three dimensions, 
D. cerebrum may have further direct motion sensing pathways along 
additional axes, in line with particle motion tuning in saccular$*** or 
lagenar®® afferents in other species. In summary, we did not find two 
pressure sensors that could detect a pressure level difference (P-ILD) 
or time difference (P-ITD) along the binaural axis, but rather a single 
pressure sensor (Fig. 3f) and aset of particle motion sensors (Fig. 3g). 

We consider whether D. cerebrum could have another pressure sen- 
sor that the vibrometry measurements did not detect. Like in other 
cyprinids, their swim bladder is divided into an anterior and a posterior 
part. To rule out D. cerebrum using a pressure difference between these 
divisions, we repeated our behavioural analysis for only those startles 
in which the anterior—posterior axis of the fish was near-orthogonal 
to the axis of sound presentation, giving equivalent results (Extended 
Data Fig. 12a). Theoretically, D. cerebrum might possess other, unknown 
pressure sensors to implement P-ILD or P-ITD. However, all known 
sound pressure sensors in fish are based on compressible gas-filled 
organs. As gas-filled structures have a high micro-CT contrast, hypo- 
thetical pressure sensing organs would have to be either microscopic to 
evade detection, or based on an unknown principle of sound pressure 
transduction without compressible gas. Neither of these options is sup- 
ported by our current knowledge of fish biology and physics of sound. 

We therefore reject P-ILD and P-ITD as plausible mechanisms. Instead, 
the D. cerebrum anatomy is well suited for implementing Schuijf’s model 
for directional hearing. 


Discussion 

In this study, we report direct experimental evidence for directional 
hearing in fish and identify its underlying biophysical mechanism. 
Directional startles were acoustically elicited in male and female D. cer- 
ebrum, irrespective of lateral line ablation. A selective inversion of 
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the pressure component tricked D. cerebrum into performing a star- 
tle towards the active speaker rather than away, consistent with the 
hypothesis that fish compare pressure and particle motion signals 
to infer direction (Schuijf’s model). On the basis of anatomical and 
behavioural data, we rejected all known alternative models, including 
binaural models for directional hearing. Using optical vibrometry, 
we confirmed the existence of a direct pathway for particle motion 
sensing and an indirect pathway for pressure sensing in D. cerebrum. 
Hence, the D. cerebrum hearing apparatus supports a dual sense of 
hearing that allows for a comparison between pressure and particle 
motion. Together, these findings suggest that Schuijf’s model is actu- 
ally implemented in nature (Fig. 4). 

Our work builds ona large body of pioneering work on fish hearing, 
summarized in Supplementary Table 1. To provide an overview of past 
evidence for directional hearing in fish, we categorized publications 
into five study types. We indicated to what degree authors controlled 
acoustic variables and whether they could judge their importance to 
directional hearing. These previous studies faced a trade-off: they were 
carried out either in open water, in which reverberations are negligible, 
but experiments are challenging so that only a few fish were tested“, or 
inthe laboratory, in which more fish could be tested but pressure and 
particle motion were not fully controlled, and the lateral line function 
could not be ruled out as a near-field sense that aids directional hear- 
ing. Here we addressed these limitations by measuring the impulse 
response of our speakers, actively cancelling reverberations and pre- 
cisely controlling natural and unnatural stimuli. 

Hearing in humans refers to the perception of pressure oscillations. 
We have shown that D. cerebrum instead have a dual sense of hearing 
comprising pressure and particle motion sensing, which is used for 
directional hearing. It could be that the acoustic world is much richer 
for fish than for humans, with acoustic events carrying stereotypic dual 
pressure-motion signatures’ that may even reveal their distance*. 
As D. cerebrum is a vocal species, directional hearing may also have a 
social function. 

Schuijf’s model has recently been extended through a proposal that 
fish compute the time-averaged product of pressure and motion (the 
acoustic intensity vector)*. This theory can account for phonotaxis of 
plainfin midshipmen towards monopole and dipole sources!”!*, Future 
neurophysiological work may test whether D. cerebrum implements 
Schuijf's model this way. 

D. cerebrum shares the Weberian apparatus with other otophysans, a 
superorder that includes 66% of living freshwater fish species and 15% 
ofall living vertebrate species”. Even in fishes lacking the Weberian 
apparatus, otolithic end organs may still inherit pressure-induced 
swim bladder motion?$*27!72, Hence, Schuijf’s model may have wide- 
spread applicability. 
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Methods 


Animals 

All animal experiments conformed to Berlin state, German federal and 
European Union animal welfare regulations and were approved by the 
LAGeSo, the Berlin authority for animal experiments. D. cerebrum were 
kept in commercial zebrafish aquaria (Tecniplast) with the following 
water parameters: pH 7.3, conductivity 350 uS cm, temperature 27 °C. 
We used male and female adult fish between 4 and 11 months of age. 


Behavioural setup and protocol 

The experimental setup comprised an inner 10 cm x 10 cm 
(length x width) tank with <200 pm thin optically opaque but acousti- 
cally transparent polypropylene sheet walls (cut out of plastic folders), 
surrounded by an outer tank with submerged speakers (4 x 3 Ekulit 
LSF-27M/SC 8 Q in custom waterproof enclosures). Thus, the speakers 
were visually shielded from the fish inside the inner tank, and the fish 
were confined between the speakers (Fig. 1c provides a schematic; 
further details are provided in Extended Data Fig. 1a,b). The height of 
the water was 10 cm, and the transparent bottom of the inner tank was 
at 6.3 cm, leaving 3.7 cm to the water surface as a water column for the 
fish to swim in. The speakers were level with this water column, and all 
sounds were targeted for this water column. Infrared light-emitting 
diodes illuminated the fish from below. The inner tank was filmed with 
an overhead camera at 120 fps at 336 x 336 pixel resolution, and live 
tracking of the fish was carried out ona subset of frames at 15 fps. White 
light-emitting diodes lit the setup indirectly via reflections from the 
room walls. The room and water temperature was kept at 27 °C. 

Each fish was tested once, and one fish was tested at atime. Inthe first 
minutes of the recording, a10 cm x 10 cmacrylic plate with centimetre 
markings was placed in the inner tank to match the sound calibration 
grid to the video frame. Three minutes after placing the fish in the inner 
tank, playbacks were triggered for 45 min. 

To probe into left-right directional hearing, playbacks from the front 
or back of the fish should be avoided. We prompted D. cerebrum to align 
with respect to the left-right speaker x axis to increase experimental 
throughput: previously, we observed that D. cerebrum swim closer 
to white than to black walls. By using two black plastic films as walls 
across the x axis and two white films across the y axis, we encouraged 
D. cerebrum to oscillate between the white walls, along the y axis 
(Extended Data Fig. 1c). Consequently, the ratio of distance covered 
along they axis to the distance covered along the x axis was 1.6. Sound 
playback occurred only when fish were orthogonally oriented withina 
45° angle measured from the orthogonal axis and within a1.5 cm x 3 cm 
trigger zone at the centre, leaving at least 3.5 cm distance to the near- 
est wall (the typical startle displacement is mean + s.d. = 1 cm + 0.4 cm 
after the first 50 ms). The minimal delay between playbacks was set to 
>5 s with a minimum delay of 5 s plus a random delay, drawn from an 
exponential distribution with a mean of 5 s for each trial. This paradigm 
averaged to about 1.3 playbacks per minute in untreated fish and to 
about 0.6 playbacks per minute in lateral line-ablated fish. 

Twelve target sounds were generated from a recorded pressure wave- 
form (see the section of the Methods entitled Sound stimulation wave- 
forms), targeted to the fish’s current position to cancel reverberations 
(see the section of the Methods entitled Calibration and reverberation 
cancellation), and presented to the fish in random order following 
trigger events using custom-written code in Python 3. 

The data in Figs. 1-4 and Extended Data Figs. 4-7 and 12a stem from 
65 untreated fish (3,798 playbacks, 1,415 startles, about 37% startles). 
For eachstimulus, we indicated the number of fish that responded with 
at least one startle. The same experiment, also comprising 12 sound 
configurations, was repeated with 74 lateral line-ablated fish (Extended 
Data Figs. 7 and 9; 2013 playbacks, 910 startles, 45% startles). A third 
sound playback experiment was carried out in the dark in 43 untreated 
fish, testing a subset of 4 sound configurations (Extended Data Fig. 12b). 


Behavioural analysis 

Tracking. Pose tracking of D. cerebrum’s swimming behaviour was 
carried out with SLEAP®. In total, 140 frames across nine random re- 
cordings of male and female fish were hand-labelled with a skeleton 
consisting of 7 equidistant nodes along the fish’s body segments and 
2 additional nodes, 1 for each eye. The ‘single-animal’ model was used 
for training. The model parameters and the trained model are available 
at the G-Node repository (see Data availability). 


Startle detection. Plotting the fish’s velocity against time around play- 
back revealed asharp increase in velocity after playback, clearly visible 
across all playbacks (Fig. le and Extended Data Fig. 4a). We defined 
a 25-ms time window around the time of peak velocity at which the 
speed distribution is bimodal and computed the average velocity in 
thistimewindow for eachtrialto classify allplaybacktrials with an aver- 
age velocity above 17 cms “as startle trials (Extended Data Fig. 4b,c). 
The remaining ones were classified as non-startle trials. The decision 
criterion based on speed also resulted in a clear separation in terms of 
body bend (Extended Data Fig. 4d). 


Directional bias. To classify startles into left or right, we measured 
the fish’s x displacement during the first 50 ms after startle initiation 
(Fig. le). This duration was chosen because displacement heat maps at 
varying delays revealed that the initial, lateral displacement phase of the 
startle response peaks after 50 ms (Extended Data Fig. 4f). Wherever 
centred trajectories are shown, these initial 50 ms are depicted. The 
directional bias of the startle response is the fraction of startles to one 
indicated direction (left or right, away or towards speaker). This bias 
can be computed in two ways. 

Directional bias across trials and fish. For each stimulus or set of stimuli, 
startle trials were pooled across all fish, and the fraction of startles in 
one direction was calculated. Using the two-sided binomial test, we 
calculated how likely a measured directional bias (approach or escape) 
would have been observed if the response was unbiased. 
Directional bias per fish. In the analysis of bias across trials and fish, 
theoretically, all trials could stem from one performing animal (not 
of concern here; Extended Data Fig. 5a,b). To complement this mea- 
sure, we also quantified the directional bias per fish. We had 12 sound 
configurations in each experiment and startles averaged to a total of 
about 22 startles per fish in an experiment; hence, a meaningful per-fish 
bias could be computed only on pooled sound configurations and for 
fish with many startles. To estimate the directional bias of individual 
fish, we filtered for fish that had 210 startles in both the single-speaker 
condition (pooled over 4 stimuli) and the trick condition (pooled over 4 
stimuli; Extended Data Fig. 6c,d). Although the value reflects directional 
behaviour in the population and estimates fish-to-fish variability, it 
selects for fish that trigger many playbacks and startle often. 


Micro-CT 

A 12-month-old male wild-type D. cerebrum was euthanized by ice 
shock and fixed with 4% paraformaldehyde in phosphate-buffered 
saline (PBS) at 4 °C overnight. The next day, the fish was washed for 
15 min in PBS before being stained with 5% phosphomolybdic acid 
(Sigma Aldrich) solution in PBS at 4 °C overnight. After staining, the 
fish was washed in PBS for 15 min before embedding in 1% PBS-buffered 
agarose inside a cryo tube. The micro-CT scan was carried out at the 
ANATOMIX beamline at SOLEIL synchrotron by XPLORAYTION. The 
sample was placed into a 40-keV polychromatic (white) X-ray beam. 
A scan consisted of 3,200 projections collected at about x10 optical 
magnification by a digital camera (Orca Flash 4.0 V2) with a sensor pixel 
size of 6.5 um at 150 ms exposure time, yielding an effective pixel size 
of 0.6485 um. The registered data were binned to 1.2970 um voxel size. 
Key structures of the hearing apparatus were manually segmented. 
To this end, planes were hand-labelled using 3D Slicer” (v5.6, https:// 


slicer.org) and then interpolated using Biomedisa (v23)”. FIJI Image] 
(v1.5) was used to convert between different file types. The segments 
were turned into mesh grids and loaded into Blender for cleaning and 
rendering. 


Lateral line ablation and DASPEI staining 

To rule out that the lateral line organ senses sound directionality in our 
experiments, we ablated the lateral line using neomycin”. To ablate the 
neuromasts, fish were placed in a 200 uM neomycin solution for about 
30 min. Afterwards, they were transferred to a beaker with tank water. 
Behavioural experiments started after 230 min. To confirm the reliabil- 
ity of the lateral line ablation protocol, we stained 30 neomycin-treated 
fish. After the behaviour experiment, they were transferred to a100 uM 
DASPEI (2-[4-(dimethylamino)styryl]-1-ethylpyridinium iodide) solu- 
tion and then to a beaker with tank water to wash out unbound DASPEI. 
Afterwards, the fish were euthanized with an ice shock and imaged with 
an epifluorescence microscope. Neuromasts were reliably stained in 
control fish but not inneomycin-treated fish, indicating reliable abla- 
tion (see Extended Data Fig. 9e,f for example images). As functional 
metrics we report an increase in number of wall contacts after startles 
(Extended Data Fig. 9g) and a decrease in foraging strikes in the dark 
(Extended Data Fig. 9h) in neomycin-treated fish. 


Vibrometry 

Confocal microscope. The confocal reflectance microscope was 
based on a custom-built laser-scanning two-photon microscope (Ex- 
tended Data Fig. 10a). The illumination source was a Ti:sapphire laser 
(MaiTai DeepSee; SpectraPhysics) operated at 810 nm (with or without 
mode-locking). Before entering a laser-scanning two-photon micro- 
scope, the beam passed througha 90:10 beam splitter (90% reflection, 
10% transmission). The light back-scattered by the fish inner structures 
was descanned, reflected by the 90:10 beam splitter, and then focused 
by a lens (f= 50 mm) into a single-mode fibre (core diameter: 25 um, 
numerical aperture: 0.1) acting as a confocal pinhole. The microscope 
was controlled by custom-written software (https://github.com/dan- 
ionella/Ismaq). 


Acoustic stimulation. Fish were anaesthetized in 120 mg!" fish 
water-buffered MS-222. They were subsequently placed on a pre- 
formed agarose mould, which allowed the gill covers to move freely, 
and immobilized with 2% low-melting-point agarose (melting point 
25 °C). A flow of aerated aquarium water (with anaesthetic) was deliv- 
ered to their mouth through a glass capillary. 

The fish was acoustically stimulated using two facing speakers sealed 
in custom-made waterproof enclosures. The diaphragms were exposed 
to water. The speakers were each placed about 1.3 cm away from the fish. 
They were driven using a DAQ card (National Instruments USB-6211), 
connected through audio amplifiers (Kemo MO31N, 3.5 W). Pressures 
of up to about 176 dB (referenced to 1 Pa) were thus generated at the 
fish position in the pressure-only configuration and particle motion of 
upto about 8 mm s''intheparticle-motion-only configuration, consist- 
ent with the expected amplitude relationship between pressure and 
particle motion in the sound monopole near field. 


Motion phase maps. The principle of the laser-scanning vibrometric 
measurement is illustrated in Fig. 3b and Extended Data Fig. 10b. The 
sample (Extended Data Fig. 10b(i)) was stimulated with an acoustic 
sinusoidal wave at frequency f iny and imaged with a laser-scanning 
microscope witha line rate ian (Extended Data Fig. 10b(ii)). 

To reconstruct amplitudes and relative phases of sinusoidal object 
motion, we needed to measure each pixel under more than two differ- 
ent phases according to the Shannon-Nyquist sampling theorem. As 
noise can influence this measurement, we used four phase steps here, 
ensuring proper phase reconstruction while keeping acquisition ses- 
sions reasonably short. 


To reconstruct the displacement of the moving structures inside the 
fish, each line of the image was repeatedly scanned nStep = 4 times, 
witha phase offset of 1/2 between each line (Extended Data Fig. 10b(iii)). 
To this end, the stimulation frequency and the line rate must follow 
the relationship: 


fetim = (N+ Y/nStep)fean 


with N being an integer. To maximize the line rate, we took 
N= floor (Krim can): 

This in turn set additional constraints on the various scanning 
parameters. We used Ícan = 800 Hz and Stim = 1,000 Hz for the data 
presented in Fig. 3 and Extended Data Fig. 11. 

To ensure repeatable measurements, the acoustic stimulation and 
the galvanometric scanning mirrors were synchronized so that each 
pixel was recorded at a known sound phase. This was achieved by trig- 
gering the sound generation on each single frame scan trigger. 

Doing so, each pixel was stroboscopically probed at nStep = 4 dif- 
ferent phases of the acoustic stimulation cycle. As sound propagates 
while scanning two consecutive pixels, the probed acoustic phase is 
shifted by 2m x pixelPeriod, which was taken into account inthe motion 
reconstruction of the imaged structures (Extended Data Fig. 10c). These 
images were then reshaped to yield an (Nx,Ny,nStep) dataset (Extended 
Data Fig. 10b(iv)). 

To analyse the motion of the inner structures of the fish, we used 
Matlab 2019b anda particle image velocimetry toolbox PIVlab®, origi- 
nally developed to characterize the motion of flowing particles for 
fluid mechanics. Essentially, the particle displacement is assessed by 
cross-correlating subregions with decreasing sizes of consecutive 
images (Extended Data Fig. 10b(v)). The contrast of the reflectance 
images was enhanced before the displacement analysis, and the results 
were curated in post-processing by removing outliers and interpolat- 
ing detection gaps. 

The motion detection yielded x- and y-displacement maps at each 
of the four phases in the acoustic stimulation period. The first Fourier 
component was computed for each pixel to extract the amplitude and 
phase of the local displacement (Extended Data Fig. 10b(vi)). The phase 
was finally corrected for the accumulating phase offset along the hori- 
zontal x direction due to the line scanning procedure (Extended Data 
Fig. 10c). Owing to the synchronization of the acoustic stimulation 
with the line scanning process, we could carry out this measurementin 
several planes and obtain a consistent volumetric complex map char- 
acterizing the motion response of the various inner structures to the 
acoustic stimulation. Maximum-amplitude projections across planes 
delivered the shown two-dimensional phase maps, one for motion 
along the speaker-speaker axis (x) and one for motion orthogonal to 
the speaker-speaker axis (y). 


Sound stimulation waveforms 

We reasoned that D. cerebrum sense pressure and particle motion. 
Hence, our sound stimuli were defined in terms of three quantities: pres- 
sure, x acceleration and y acceleration, which were delivered to the fish’s 
current position by utilizing the frequency responses of speakers to 
cancel position-dependent reverberations (see the section of the Meth- 
ods entitled Calibration and reverberation cancellation). y acceleration 
was always kept at zero, and only pressure and x acceleration were 
varied. In summary, 12 sounds were generated from a recorded pres- 
sure waveform and presented to the fish in a random sequence upon 
trigger events. The 12 sounds consisted of four single-speaker sounds 
(left or right x positive polarity or negative polarity), two sounds with 
only a pressure component (positive polarity or negative polarity), two 
sounds with only horizontal x-motion components (positive polarity 
or negative polarity) and four trick conditions, which exactly matched 
the four single-speaker target waveforms, but differed by the speakers 
that were active to realize these. 
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We observed that D. cerebrum startle when we drop a cylindrical 
piece of rubber into the water. We recorded the pressure waveform of 
this sound, high-pass filtered it at 100 Hz, and extracted a12-ms snippet 
to serve as our pressure waveform template (note that conditioned 
sounds—that is, the actual speaker signals—were band-pass-filtered 
between 200 Hz and 1,200 Hz; see the following section). The target 
pressure amplitude was set to a peak sound pressure level of 167 dB 
(referenced to 1 Pa) by rescaling this pressure waveform accord- 
ingly. This amplitude was loud enough to elicit startles reliably and 
still supported by our small 2.7-cm-diameter speakers. The first peak’s 
rise time (10% to 90% absolute amplitude) was 0.664 ms andthe centre 
frequency of the pulse was about 780 Hz. The target horizontal particle 
acceleration waveform was computed from the pressure waveform 
using monopole theory for each Fourier component, as follows. 

The pressure signal decays as 1/r with radial distance r away froma 
sound monopole with amplitude p, at distance rq 

ikr 
plr,t)= Pye 


iot 


and with freguencyf, w = 21f, wavenumber k = 211/A, wavelength A and 
speed of soundc= Af. 

In a medium of density p, the radial particle velocity decays guad- 
ratically with distance in the near field (Kr «1, limit dependent on 
frequency): 


ôr, o- 2.(1* cp) 


By contrast, particle acceleration—the temporal derivative of parti- 
cle velocity—decays quadratically with distance for nearby sounds 
(r <1, limit independent of frequency): 


A HU ll |s 
â,(r,t)=-iwû,(r, t) = a G ik ptr, t) 


To compute the particle accelerationa,(r, t) ata distancer to a sound 
monopole with pressure p(r, t) for discrete signals of arbitrary wave- 
form, we applied this equation separately for each Fourier component. 
Given a pressure waveform {p,}:= p,,P,, ::, Py- With N samples p,, 
spaced at T=1/sr with sample rate sr, the particle acceleration 
{a„} := 40,4, =, Ay- that would be observed at a distance r = rọ from a 
sound monopole was calculated by carrying out the discrete Fourier 
transform {P} := P,P, +++, Py-1 


N-1 


i2mt 
= - LL 
P= ) P,e N 
n=0 


and deriving particle acceleration for each Fourier component 
{A} := Ap,A;, «++, Ay- independently. With corresponding frequencies 
f, > /(NT),suchthat k = 27l /(NTc), and the relationship between pres- 
sure and particle acceleration, A,, is calculated as 


1/1 , 
m=z i 


which defines the radial particle acceleration through the inverse Fou- 
rier transform: 


= 1 xr i^'Ujn 
i > A, ew 
io 


In the experiments, rọ was set to 3 cm, thus simulating a monopole 
sound source at 3 cm, irrespective of D. cerebrum's relative position to 
the speakers. This resulted in a peak particle acceleration of 7.59 ms”. 


Other parameters used were c= 1,500 m s”, p = 1,000 kg m° and 
sr= 51,200 Hz. In terms of pressure, x acceleration and y acceleration 
(p, a,and a,), there were eight different target configurations, with ‘+’ 
indicating polarity of the template waveform and ~’ indicating oppo- 
site polarity: four monopole configurations (+,+,0), (—,—,0), (+,-,0) and 
(-,+,0); two pressure configurations (+,0,0) and (-,0,0); and two motion 
configurations (0,+,0) and (0,-,0). Despite a total of eight target con- 
figurations, there were 12 sound configurations as the four monopole 
configurations can be realized in two ways, either with a single speaker 
or with three speakers (trick configuration; see the next section). 


Calibration and reverberation cancellation 

Conducting experiments in small tanks presents challenges as both 
tank geometry and the receiver’s position affect the sound amplitude 
and waveform sensed by the receiver (Extended Data Fig. 2c). By record- 
ing the speakers’ impulse responses inside the inner tank in terms of 
pressure and particle acceleration (Extended Data Fig. 2b), speakers 
could be activated to precisely control pressure and particle accelera- 
tion components at the fish’s location (Extended Data Fig. 2d). 


Pressure and acceleration measurements. Pressure. Pressure was 
measured with a hydrophone (Aquarian Scientific AS-1, preamplifier: 
Aquarian Scientific PA-4, acquisition: NI-9231 sound and vibration 
module, National Instruments; Extended Data Fig. 1d). During repeated 
playback of the same sound, a single hydrophone was automatically 
moved across a 5 x 5 grid inside the inner tank, sampling with a spacing 
of 1.5 cm (Extended Data Fig. 1c,e). Hence, a 25-point pressure field was 
obtained for each sound configuration, spanning a 6 cm x 6 cm square 
at the centre of the inner tank between the speakers. 
Acceleration. Particle acceleration was measured in two ways. 

In the first method, particle acceleration was measured indirectly 
through the pressure gradient. Newton’s second law of motion (pres- 
sure gradient force) 


links the spatial pressure gradient to particle acceleration. In water, 
with density p = 1,000 kg m”? and speed of soundc=1,500 ms 1, the 
following approximation holds for pressure signal frequencies 
f< 100 kHz, if the pressure gradient is sampled with step size 
Xx-x- 1.5 cm: 


ar 1 pop) -p%) 
“<p, xx 

The approximation holds for all frequencies used in this experiment. 
For measuring gradients, moving a single hydrophone is preferred over 
a hydrophone array, as the gradient could be biased by small differ- 
ences in hydrophone sensitivity and perturbations of the sound field 
by the presence of other hydrophones. We calculated x and y accelera- 
tion on the basis of the 25-point pressure field recorded with a single 
hydrophone. The pressure field included points outside the trigger 
zone to compute pressure gradients (that is, acceleration) across the 
trigger zone boundary. 

Inthe second method, particle acceleration was additionally directly 
measured along all three axes with an acceleration sensor (Triaxial 
ICP - Model 356A45, PCB Piezotronics, acquired with NI-9231 sound 
and vibration module, National Instruments; Extended Data Fig. 1d). 
Like the hydrophone, the acceleration sensor was moved across all 
5 x 5grid positions during repeated playback of the same sound, giving 
measurements for x, y and z acceleration. 

Whereas hydrophones are manufactured and calibrated for under- 
water use, the particle acceleration sensor is not made to measure 
particle acceleration underwater and is meant to be glued onto the 


vibrating object. Owing to an acousticimpedancemismatch between 
metal and water, we expected the PCB sensor to underestimate particle 
acceleration. 

We compared xand yacceleration waveforms for both measurement 
methods and found that the acceleration waveforms acquired through 
the direct method match the waveforms acquired through the indirect 
method after multiplication by a factor of about 2.4. The close matchin 
rescaled waveforms confirms the validity of the gradient approxima- 
tion in the indirect method. 

Hence, in all experiments, x and y acceleration were measured 
through the indirect method, on the basis of spatial pressure gradi- 
ents. The particle acceleration sensor still proved useful in measuring 
the vertical z acceleration in our setup. 


Impulse response-based sound targeting. To create the same sounds 
at any position inside the inner tank, impulse responses for all 4 speak- 
ers were measured across 25 positions on a 5 x 5 grid with 1.5-cm spac- 
ing. In the following, the sound targeting method is described for one 
position. 

Let k; „be the pressure impulse response kernel, k; a „be the xaccel- 
eration impulse response kernel, and k, ay bethey acceleration impulse 
response kernel for the ith speaker. Using M speakers with signal s;, 
pressure and acceleration can be predicted through convolution (s): 


In the Fourier domain, utilizing the convolution theorem, these 
become a system of equations for each Fourier component l. 


M-1 
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On the basis of the Fourier components of the target waveforms (see 
the section of the Methods entitled Sound stimulation waveforms), P, 
Ax, and Ay A and the Fourier components of the impulse response 
kernel K,, ,.K, a, ,and K, a wl the system of equations can be solved for 
the Fourier components ofthe speaker signals S; ;as long as M>3and 
the kernel components are non-zero and non-identical. The 
time-domain signal for the ith speaker is then given by the inverse Fou- 
rier transform using components S, ,. 

Toincrease robustness of the solutions (for example, to avoid speak- 
ers cancelling themselves unnecessarily and to limit speaker ampli- 
tude), speaker signal waveforms were forced to become similar to the 
target waveform. This was implemented by solving the system of equa- 
tions with a least-square solver (scipy.optimize.Isq_linear) with bounds 
=B; 1< S1 < B; „ The bound B; ,;was computed asa rescaling of the abso- 
lute Fourier components of the target pressure waveform P, 


B,= a; YIPi| 


in which y is fixed and scales pressure to voltage and a; is a rescaling 
parameter set independently for each speaker to give additional con- 
trol over active speakers. We list our values for a, used in different sound 
configurations in Supplementary Table 2. 


After conditioning, all computed speaker signals were band- 
pass-filtered between 200 Hz and 1,200 Hz to avoid activating the 
lateral line. 

To ensure that the trick configuration differed from the single- 
speaker configuration only by selective pressure inversion, atwo-step 
sound conditioning was carried out. First, the speaker signals for the 
single-speaker configuration were calculated. Then, these signals 
were effectively fixed to closely resemble the single-speaker signal 
and only activations of the two speakers along the orthogonal axis 
were conditioned. 

The above calculation was carried out for the 25 grid positions. The 
computed speaker signals accurately delivered the target waveforms 
to the target position (Extended Data Figs. 2d and 3 and Supplementary 
Table 2). To ensure consistency over experiments, the water level was 
kept at precisely 10 cm, and the pressure and acceleration fields inside 
the inner tank were measured several times (this includes before the 
first recording and after the last recording). 

During the experiment, the fish’s x-y position was detected at 15 Hz, 
and the loading for the speakers was linearly interpolated on the basis 
of targeted sounds at neighbouring grid positions. 

In the section entitled Sound stimulation waveforms, we describe 
how we defined the pressure and particle motion target waveforms 
that were conditioned this way. 


Estimation of binaural cues (P-ILD, M-ILD, P-ITD, M-ITD) 

ILDs. To estimate binaural cues in our behavioural experiment, 
we analysed the pressure and particle motion at sound calibra- 
tion grid points 3 cm apart, (x9) 1.5 cm to the left and (34) 1.5 cm to 
theright of the centre grid point. To estimate sign and peak amp- 
litude of level differences, we calculated P-ILD (pressure ILD) as 
max,(abs(p(Xo, t))) - max,(abs(p(%, t))) and M-ILD (particle motion 
ILD) as max,(abs(a,(X%, t))) - max,(abs(a,(x;,t))). The level differ- 
ences between these two points were divided by a factor 50 to esti- 
mate the level difference across the left-to-right inner ear axis of 
the fish (about 0.6 mm). Comparing the single-speaker configura- 
tion with the trick configuration, these data show that the sign of 
M-ILD remains the same (+0.11 m s? versus +0.30 ms~), but the sign 
of P-ILDis inverted (+4.4 Pa versus —4.6 Pa). Fora geometrical illustra- 
tion of the inversion of P-ILD, see Extended Data Fig. 8d. 


ITDs. ITDs were estimated by calculating the phase propagation in 
different sound configurations under a monopole approximation 
(Extended Data Fig. 8c-f). 


Reporting summary 
Further information on research design is available in the Nature Port- 
folio Reporting Summary linked to this article. 


Data availability 

Fish trajectory, micro-CT and vibrometry data have been depos- 
ited in the G-Node repository at https://gin.g-node.org/danionella/ 
veith_et_al_2024. Downsampled segmentation mesh-grid data have 
been deposited to MorphoSource at https://n2t.net/ark:/87602/ 
m4/623999. Source data are provided with this paper. 


Code availability 


Code used for data analysis is available on GitHub at https://github. 
com/danionella/veith et al 2024. 
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Extended Data Fig. 1| Setup design and sound calibration equipment. 
a, Photo of the setup. b, Schematic of the setup indicating size and material. 
The room and water temperature is kept at 27 °C; the room light is indirect light 


reflecting from white walls. Infrared (IR) light illuminates the setup from below. 


c, Schematic of sound calibration points inside the inner tank overlaid over 
aheatmap of fish positions inside the inner tank across all 65 recordings. 
D.cerebrumavoids the black walls and oscillates between the two white walls. 
Pressure and particle acceleration were measured at 25 points (x) onagrid 
spanning a 6 cm x 6 cmsquare, covering the trigger zone (green dotted 
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rectangle). Sounds were conditioned to deliver target waveforms to the fish’s 
swimming position within the trigger zone based on each speaker’s pressure 
and acceleration impulse responses for these 25 points. Points outside the 
trigger zone were measured to calculate pressure gradients across the edge of 
the trigger zone, see Methods. d, Left: Hydrophone, Aquarian Scientific, AS-1. 
Right: Triaxial acceleration sensor, ICP® - Model 356A45, PCB Piezotronics. e, 
The motorized arm moved the sensor to each grid position. Left: pressure 
measurement (was also used to compute acceleration indirectly). Right: direct 
acceleration measurement with the PCB sensor. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Sound targeting cancels reverb effects. a, Schematic 
of sound calibration points (x) inside the inner tank. Green dotted rectangle: 
trigger zone. Orange dotted square: 9 measurements and target locations 
showninc-d.b, Impulse response kernel for the left speaker at the center 
calibration point for three sound cues: pressure, x acceleration, and 
yacceleration. Pressure and x/y acceleration impulse responses were recorded 
for each speaker and each position on the grid, totaling to3 x 4x 25 =300 
waveforms capturing the acoustic properties of the tank. c, Naive playback 
paradigm (not used): The same waveform is played back via the left speaker, 
and sound cues are recorded at nine positions. Left: constant speaker signals. 


Right: Pressure and x/y acceleration measured at nine positions, spanning a 
4.5 cmx4.5 cm square at the center of the inner tank, vary considerably across 
this area, and the measured waveform does not resemble the playback 
waveform. d, Sound conditioning paradigm (used): Based on the impulse 
responses, different speaker signals are calculated for each target position. 
Left: speaker signals are different for each target position. Top and bottom 
speakers help to cancel y acceleration. Right: Pressure and x/yacceleration 
measured at the same nine positions. The waveforms resemble the defined 
target waveforms and are stable across positions. 
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Extended Data Fig. 3 | Accuracy of six exemplary sound stimuli. a, Schematic 
ofsound calibration points (x) inside the inner tank, indicating the sound 
target position and recording position for speaker signals and recordings 
shown in b. b, Pressure and acceleration measurements for sound 
configurations i) - vi) (see Fig. 2c for schematic), targeted for the inner tank’s 
center calibration point and measured at the same position. Target waveforms 
(target) are defined as pressure, x accelerations, and y accelerations. Peak 


Tine [ms] 


pressure: 223.87 Pa, peak x acceleration: 7.59 m/s’. The speaker signals (top 
panels in i-vi) are calculated based on impulse responses to deliver the target 
waveforms to the target position with the constraint that, e.g., the right 
speaker is inactive in the left speaker configurations (i & ii) andin the left 
speaker trick configurations (vi). For a given speaker signal, convolution with 
the respective impulse response kernels predicts (prediction) the measured 
waveforms. 
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Extended Data Fig. 4 | Startle detection and dynamics. a, Fish translational 
speed, shown for all n = 3798 playbacks across twelve stimulus configurations, 
and across 65 fish aligned to playback trigger att = O ms (thin lines). The mean 
speed across playbacks is shown as a thick line. To trigger a playback, the fish 
had to swim into the trigger zone. This explains the small increase in mean 
speed prior tot = 0 ms (white arrow). b, We used a 25 ms time window (between 
vertical dashed lines) and classified trials as startle trials ifthe temporal 
average speed across this time window exceeded a threshold of 17 cm/s 
(horizontal line). The remaining ones were classified as non-startle trials. The 
threshold was set such that the average speed across non-startle trials 
increased slightly after playback (black arrow). This reflects a conservative 
choice for startle detection, classifying a few startles as non-startles rather 
than classifying non-startles as startles. c, Top: The average speeds computed 


acrossa25 ms time window before each playback are below the selection 
threshold of 17 cm/s. Bottom: The average speed distribution within the 25 ms 
time window indicated inb, after playback, is bimodal. Hence, startles canbe 
readily identified by a speed threshold. d, Body bend: The average absolute 
angle taken between six edges along the fish’s body axis after playback across 
startles increases sharply. It remains constant if averaged across non-startles. 
Startles are initiated by a strong body bend. e, Centered trajectories for all 
playbacks with startle reaction (n = 1415) across all 65 fish after aligning initial 
fish heading. The first 50 ms of startle responses are shown. f, Normalized 
heatmaps of the fish head position at consecutive times relative to the 
position at the start of the startle response. The top row shows snapshots of 
the trajectories shown ine. After a fast initial displacement to the left or right, 
D. cerebrum continues on a forward trajectory. 
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Extended Data Fig. 5| See next page for caption. 
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Extended Data Fig. 5| Habituation of the startle response. a-b, Startle 
probability of each fish across all sound configurations as a function of the 
number of playbacks triggered by the fish. a, For main experiment (in 
untreated fish). b, For experiment in neomycin-treated fish. c-f, Details on 
habituation, data from main experiment (in untreated fish). c, Startle 
probability (the fraction of playbacks with startles, computed per bin) 
increases with time since the previous playback (green curve, the 95% 
confidence interval is given by Wilson scores). d, Startle probability (the 
fraction of playbacks with startles, computed per bin) decreases throughout 
the recording (green curve, the 95% confidence interval is given by Wilson 
scores). e, Startle probability as 2D binned statistics to show the interaction 
between time in recording and time since the last playback with the number of 
previous playbacks (Nth playback); white: no data. f, Average speeds over the 


course of the recording. Because fish that startle after playback havea higher 
speed before playback (see also Extended Data Fig. 4b), adecreased swimming 
speed could explain the decreasing startle probability over the course of the 
recording shown ind. Average speed is computed across 25 msatatimels 
before sound onset (before playback) and across 25 ms after sound onset (after 
playback), separately for startle and non-startle trials. The confidence interval 
indicates the standard error of the mean. Note the split y-axis. The speed of fish 
before playback stayed constant over the course of the recording. Hence 
habituation is not explained by a decrease in swimming activity over 50 min. 
a-f, Together, startle probability increases if the fish is ina fast swimming state 
and decreases, the less time has passed since the last playback and the more 
playbacks have been played previously (both signs of habituation). 
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Extended Data Fig. 6| See next page for caption. 
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Extended Data Fig. 6 | Directional startles are present for both sexes andin 
individual fish. Centered startle trajectories for single speaker configurations 
and trick configurations, same data as shown in Fig. 2f but separated by sex. 
Both male and female D. cerebrum startle away from single active speakers and 
startle towards the active speaker in the trick configuration. a, Single speaker 
configuration. Left: females (75% of 186 startles away from the speaker in 26 
female fish that startled, two-sided binomial test: p = 9 x10”). Right: males 
(83% of 312 startles away from the speaker in 38 male fish that startled, two- 
sided binomial test: p = 1x10 >>). b, Trick configuration. Left: females (72% of 
107 startles towards the speaker in 23 female fish that startled, two-sided 
binomial test: p = 6 x 10°). Right: males (64% of 224 startles towards the 
speaker in 38 fish that startled, two-sided binomial test: p = 2 x 105). a-b, 
Percentages indicate the fraction of startle displacements into the direction of 
the blue arrow. The heatmaps are normalized and smoothed 2D histograms 


over endpoint positions of the shown trajectories. c, To estimate the directional 
bias in individual fish, we selected fish with at least one startle in both the single 
speaker condition and the trick condition (N = 46) to compute the number of 
fish with a mean directional bias away from the speaker (bias > 0.5): 41 0f 46 fish 
in the single speaker configuration, two-sided binomial test: p = 4.x10°°, and 

10 of 46 inthe trick configuration, two-sided binomial test: p = 2x10 *. Hence, 
individual fish escaped away from the single left-right speaker but approached 
itin the trick condition. d, The mean directional bias computed across fish with 
>10 startles in both conditions (N = 5, a subset of a) amounted toa directional 
bias of mean + std. = 84% + 13% away from the speaker in the single speaker 
configuration and to 72% + 9% towards the speaker in the trick configuration. 
c-d, Line thickness is proportional to the number of minimum startles in 
response to one of the two sets of stimuli. Blue line: average directional bias 
across fish directional biases. 
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Extended Data Fig. 7 Data summary on all twelve stimuli for both cohorts. Percentages indicate fraction of startles to the right. P-values report the 
Centered startle trajectories and displacement heatmaps for each stimulus probability that startles are unbiased in any direction (Two-sided binomial test, 
used inthe experiments, startle trials are pooled across all tested fish. a, for null hypothesis: p = 0.5, Bonferroni-corrected: n= 12 stimuli). 

fish without an ablated lateral line. b, for fish with an ablated lateral line. a-b, 
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Extended Data Fig. 8 | Sound monopoles and sound configurations. a, 
Pressure level (dashed line) falloff next to a sound monopole at several phase 
snapshots of a propagating wave. Left: Falloff of al kHz wave over 1.5 m. Right: 
The falloff across the width of the fish at 3 cm distance to a monopole sound 
source stems from the level falloff with distance. b, Both amplitude ratio and 
relative phase between pressure and particle velocity change in a distance- 
dependent manner. Both sound directions (-x, x) stay separate in the relative 
phase between pressure and particle velocity. c-f, Results of a simple model 
used to illustrate level and phase of pressure and motion along the horizontal 
x-axis of the inner tank. The idealized speaker activations in the different sound 
configurations are modeled as sinusoidal monopole sound sources (see 
pressure equation in b) located 6 cm away from the origin with frequency 

f- 780 Hzand speed of sound c - 1500 m/s. Acceleration is calculated from the 
spatial pressure gradient along the x-axis. The top rows are phasor 


representations of pressure or motionat five positions along the horizontal 
axis as indicated in the left cartoon. ILDs and ITDs are computed across a 
distance of 600 pm centered at the origin. See also Fig. 4 for a summary on ILD 
and ITD across sound configurations. c, Inthe single speaker configuration, 
P-ILD, P-ITD, M-ILD, and M-ITD could be interpreted as rightward cues by the 
fish. M-ITD is even smaller than P-ITD as motion phase propagates slower than 
pressure phase in the near field. d, Inthe trick configuration, P-ILD and P-ITD 
are inverted, while M-ILD and M-ITD remain unchanged as compared tothe 
single speaker configuration. c-d, See Methods section on interaural cues for 
comparable P-ILD and M-ILD measurements in our setup. Note that we model 
monopoles in open water here, but the actual speakers are extended pressure 
sources ina tank. e-f, In both the pressure configuration and the particle 
motion configuration P-ILD, P-ITD, M-ILD, and M-ITD are zero or undefined. 
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Extended Data Fig. 9 | Directional behaviour is present after lateral line 
ablation. a-d, Same as in Fig. 2c-f but for lateral line ablated (LLA) D. cerebrum 
a, Illustration of six different stimulus configurations. The cartoons of 
resulting traces illustrate the pressure and particle velocity relationship, 
idealized for a plane wave scenario. b-d, Centered startle trajectories after 
sound playback. b, Directional escapes away froma single speaker for a positive 
polarity waveform (66% escapes, two-sided binomial test: p = 0.0003 across 
n=142startlesin 67 fish with startle trials in this sound configuration) and 
negative polarity waveform (69% escapes, two-sided binomial test: p =1x 10° 
acrossn = 164 startles in 69 fish with startle trials in this sound configuration). 
c, Absence of directional bias in the positive polarity condition with only 
pressure (51%, two-sided binomial test: n.s.; p > 0.05) and only particle motion 
(48%, two-sided binomial test: n.s.; p> 0.05). See Extended Data Fig. 8 for 
opposite polarity results. d, Single speaker playbacks pooled over both 
polarities evoke a directional escape (67% escapes, two-sided binomial test: 
p=1x10~ across n=306 startles in 72 fish with startle trials in this sound 
configuration). Selective inversion of pressure polarity by an additional 
opposing pair of speakers along the orthogonal axis inverts the relative 
polarity between pressure and particle velocity. This implements the trick 
condition, in which the fish is tricked into performing startles approaching the 
active speaker (68% approaches, two-sided binomial test: p = 2 x 10° across 


n=271startles in 73 fish with startle trials in this sound configuration). b-d, 
Percentages indicate the fraction of startle displacements into the direction of 
the blue arrow. The heatmaps are normalized and smoothed 2D histograms 
over endpoint positions of the shown trajectories (gray for single stimulus 
data, blue for pooled data). e-f, D. cerebrum stained with DASPEI, imaged under 
an epifluorescence microscope. Images are constructed from several fields of 
view (gray dotted lines). Scale bar: 2 mm e, In untreated fish, neuromasts are 
visible as green dots (see exemplary white arrows). f, After neomycin 
treatment, no neuromasts are visible, confirming lateral line ablation. g-h, 
Functional indicators of lateral line ablation. g, Neomycin-treated fish have 
significantly more wall contacts within the secondary escape trajectory, i.e. 
within -500 ms (independent one-sided t-test p =1x 10%, after blind manual 
classification of the first ten startles in each recording into startles with and 
without wall contact. Some fish had less than ten startles, hence black dots are 
not always multiples of 10%). h, Left: In the light, untreated control shoals of 
mixed-sex adult D. cerebrum did not capture significantly more Artemia than 
neomycin-treated shoals (n = 4 shoals in each condition, independent one- 
sided t-test, p = 0.17). Right: In the dark, untreated shoals captured more 
Artemia (p = 0.024, independent one-sided t-test, n = 4 shoals in each 
condition), possibly using their intact lateral line sense. g-h, vertical gray stripe 
on violin plot indicates quartiles of data. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | Method for extracting 2D phase maps of tissue 
motion with laser scanning confocal reflectance microscopy. a, 
Experimental setup (see Methods section on vibrometry for details onthe 
acoustic stimulation system). b, Illustration of phase map extraction: (i) A 
single bead is located within the field-of-view and (ii) oscillates in the horizontal 
x direction during acoustic stimulation. (iii) The bead is imaged witha laser 
scanning microscope, with each pixel being acquired at a different time. Line- 
scan and acoustic stimulation are synchronized to probe the bead at four 
different phases (blue: O, red: 11/2, yellow: n, green: 37/2). (iv) Data are reshaped 
to reconstruct the full movie of the bead motion. (v) The bead displacement is 
computed using across-correlation-based algorithm (see Methods). (vi) The 


amplitude and phase of the first Fourier component of the bead displacement 
are extracted and plotted respectively with hue and color. c, (i) The sound 
phase and consequently the bead displacement phase is drifting when the 
pressure wave propagates along the horizontal x direction. Different motion 
phases are then detected for objects at different locations in the field-of-view, 
although being stimulated by the same sound wave. (ii) This additional phase Y 
is subtracted to yield a phase map with free objects exhibiting the same phase 
for the same sound stimulation. The final phase relationship between various 
objects therefore only depends onthe mechanical properties of the imaged 
structure. 
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Extended Data Fig. 11|2D optical vibrometry in three sound 
configurations. Phase-amplitude particle velocity maps for 1 kHz component 
in response to 1 kHz speaker signals in the single speaker configuration 
(column I), the pressure configuration (column 2), and the particle motion 
configuration (column 3). Depicted velocity amplitudes were scaled up bya 
factor of two in the single speaker condition to more readily compare 
responses to the pressure and motion configurations, where two speakers 
created asignal with approximately twice the pressure and motion amplitude. 
Across all three panels a-c, single-speaker maps resemble the addition of the 


pressure and motion maps. Sound stimulation was along the x-axis in all cases. 
See Fig. 3a for the location of depicted views and Fig. 3c-e for names of 
anatomical regions. a, Tripus tip and scaphium contract along the fish’s medial- 
lateral axis (x) in the pressure configuration but not inthe particle motion 
configuration. b, The scaphium and the sagitta move at different phases along 
the fish’s ventral-dorsal axis (x) relative to surrounding tissue in the pressure 
configuration but not in the particle motion configuration. c, The lapillus 
moves witha phase lag to surrounding tissue along the medial-lateral axis (x) in 
the particle motion configuration but not the pressure configuration. 
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Setup modification for experiment in dark 
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Extended Data Fig. 12 | Excluding rostro-caudal swim bladder mechanism 
and cryptic visual cues. a, To test the hypothesis that the pair of rostral and 
caudal swim bladder could implementa P-ILD or P-ITD sensor along the 
rostrocaudal axis, we filtered for playbacks where D. cerebrum was aligned 
orthogonal to the left-right axis during sound arrival at the fish location. Top: 
all data, same as Fig. 2f. Middle: filtered for +10° alignment to y-axis, Single 
speaker (79% escapes, two-sided binomial test: p = 6 x 10%, n - 183 startle trials 
in N - 59 fish), Trick configuration (72% escapes, two-sided binomial test: 
p=3x107,n=133 startle trials in N = 56 fish). Bottom: filtered for + 3° alignment 
to y-axis, Single speaker (74% escapes, two-sided binomial test: p=1* 10, 

n= 66startle trials in N = 47 fish), Trick configuration (76% escapes, two-sided 
binomial test: p= 8x10 *, n = 45 startle trials in N =45 fish). b-c, Replication of 


L, 


Trick configuration 
[pressure inversion stimuli] 


Single speaker + 
[L+, Rz (flipped)] 


! n=92 


n= 93 
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the experiment in the dark to exclude the possibility of unknown visual cues. 
b, Top view of the modified setup for playback experiment in the dark. Inthe 
dark, the white inner tank walls could no longer prompt D. cerebrum to swim 
orthogonal to the left-right axis. To nevertheless trigger playbacks at the 
center of the tank with the fish being aligned orthogonally within a 45° cone, 
we added an additional constraining tank made from thin transparent plastic, 
thereby increasing the likelihood of triggering playback. c, Centered startle 
trajectory for experiment in the dark. Left: activation of the left speaker leads 
to rightward startles (71%, two-sided binomial test: p = 9 x 10°, n= 92startle 
trials in N = 43 fish). Right: activation of the right speaker leads to leftward 
startles (69%, two-sided binomial test: p = 0.0004,n=93 startle trials in 
N=43 fish). 
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Statistics 
For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex technigues in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


— A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 
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Data collection We used custom software written in Python 3 and MATLAB (v2019b) (https://github.com/danionella/veith_et_al_2024) 
as well as LSMAO for laser scanning microscopy (https://github.com/danionella/Ismaq) 
Data analysis We used custom software written in Python 3 and MATLAB (v2019b), as well as SLEAP (v1.3), BIOMEDISA (v23), FIJI (v1.5), 3D Slicer (v5.6) 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and 
reviewers. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Portfolio guidelines for submitting code & software for further information. 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unigue identifiers, or web links for publicly available datasets 
- A description of any restrictions on data availability 


- For clinical datasets or third party data, please ensure that the statement adheres to our policy 


Fish trajectory, micro-CT, and vibrometry data have been deposited in the G-Node repository at: gin.g-node.org/danionella/veith_et_al_2024 
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groupings 

Population characteristics N/A 
Recruitment N/A 
Ethics oversight N/A 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


X Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size The sample size was chosen based on the standards in the field and estimated behavioral variability. 
Data exclusions No data were excluded. 


Replication We successfully replicated our findings in males (39), females (26), and lateral-line ablated fish (74). Observations shown in Figure 3 c-e were 
repeated once, four times, and once in other fish, respectively, with similar results. 


Randomization There was no group allocation which needed to be randomized. 


Blinding There was no group allocation which would have required blinding. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
l Antibodies ChIP-seq 
l Eukaryotic cell lines Flow cytometry 
l Palaeontology and archaeology MRI-based neuroimaging 
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Ll Dual use research of concern 


Ll Plants 


Animals and other research organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research, and Sex and Gender in 


Research 


Laboratory animals 
Wild animals 
Reporting on sex 
Field-collected samples 


Ethics oversight 


Wildtype adult (> 4 month old) Danionella cerebrum 

No wild animals were used in this study 

Findings apply to both sexes and sexes have been indicated explicitly 
No field-collected samples were used in this study 


All animal experiments conformed to Berlin state, German federal and European Union animal welfare regulations and were 
approved by the LAGeSo, the Berlin authority for animal experiments. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Seed stocks 


Novel plant genotypes 
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Report on the source of all seed stocks or other plant material used. If applicable, state the seed stock centre and catalogue number. If 
plant specimens were collected from the field, describe the collection location, date and sampling procedures. 


Describe the methods by which all novel plant genotypes were produced. This includes those generated by transgenic approaches, 
gene editing, chemical/radiation-based mutagenesis and hybridization. For transgenic lines, describe the transformation method, the 
number of independent lines analyzed and the generation upon which experiments were performed. For gene-edited lines, describe 
the editor used, the endogenous seguence targeted for editing, the targeting guide RNA seguence (if applicable) and how the editor 
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assess the effect of a mutation and, where applicable, how potential secondary effects (e.g. second site T-DNA insertions, mosiacism, 
off-target gene editing) were examined. 
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